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ABSTRACT 

Recently,  researchers  at  the  Naval  Research  Laboratory  have  developed  the  SWORrD  system  for  measuring  two- 
dimensional  Raman  Spectra.  The  device  consists  of  a  tunable  2d  ultraviolet  laser  that  illuminates  the  sample  at  various 
wavelengths  (210-300  nm)  and  collects  a  single  Raman  spectrum  at  each  laser  wavelength.  The  single  spectra  are 
combined  to  form  a  two-dimensional  spectrum  (laser  wavelength  by  scattered  wavenumber). 

In  this  paper  we  introduce  a  novel  method  for  the  detection  of  known  agents  (‘targets’)  within  measured  2d  spectra.  Our 
method  is  bases  on  ‘linear  mixed  pixel’  techniques  from  hyperspectral  imagery;  in  particular,  we  generalize  the  Adaptive 
Subspace  Detector  (ASD)  to  a  form  suitable  for  SWORrD  samples.  Our  detector  uses  the  individual  laser  runs  to  define 
a  set  of  points  within  wavenumber  space;  the  set  of  points  corresponding  to  a  2d  spectra  defines  a  particular  subspace 
that  contains  each  material.  These  subspaces  are  then  used  with  ASD  to  identify  targets.  We  include  experimental 
results  using  real-world  data  to  illustrate  our  results. 
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1.  INTRODUCTION 

The  Swept- Wavelength  Optical  resonance-Raman  Device  (SWOrRD)  [1,2]  is  a  new  system  that  has  been  developed  by 
the  Naval  Research  Laboratory  to  measure  resonant  Raman  spectrum  over  a  variety  of  input  laser  wavelengths,  ranging 
from  the  deep  Ultraviolet  (UV)  to  the  visible.  SWOrRD  consists  of  a  tunable  laser  that  illuminates  the  sample  under 
study  and  a  two-stage  spectrometer  for  recording  the  Raman- scattered  light  that  is  emitted.  The  laser  is  a  gain-switched 
Ti:Sapphire  oscillator  that  operates  at  5  kHz  and  generates  18  ns  TEM-00  pulses  tunable  from  700  -  940  nm  in  1  nm 
steps.  Light  from  the  oscillator  is  converted  with  barium  borate  crystals  to  either  the  third  or  fourth  harmonic  for  output 
from  210  -  280  nm,  with  a  spectral  width  of  approximately  4  cm'1.  Tuning  the  laser  is  synchronized  with  the  angular 
positions  of  the  gratings  in  the  spectrometer,  and  takes  less  than  one  minute  to  tune  from  one  wavelength  to  the  next. 

By  varying  the  input  laser  wavelength,  SWOrRD  creates  a  two-dimensional  Raman  spectrum  by  ‘building  up’  a  series  of 
one-dimensional  spectra,  one  for  each  laser  input  (Fig.l).  Any  input  energy-dependent  resonances  will  appear  as 
‘bumps’  within  the  SWOrRD  spectrum.  This  combination  of  traditional  Raman  spectroscopy  with  resonance 
information  creates  a  novel  measurement  method  that  contains  much  more  information  about  a  given  sample  than 
previous  methods.  Our  goal  is  to  use  this  extra  information  in  order  to  develop  new  methods  for  identifying  desired 
materials  within  unknown  measured  samples. 

In  this  paper  we  introduce  one  method  to  for  doing  so,  based  on  the  ‘linear  mixing’  assumption  common  in  signal-  and 
hyperspectral  image  processing.  This  model  assumes  that  the  spectrum  corresponding  to  a  given  sample  that  includes 
several  ‘pure’  components  may  be  written  (at  least  approximately)  as  a  linear  sum  of  the  spectra  of  the  individual 
components.  Over  the  last  several  years,  a  wide  variety  of  different  ‘target  detection’  algorithms  [3]  have  been 
introduced  in  the  literature  to  identify  individual  materials  in  a  linear  mixture.  Our  method  is  based  on  one  such 
algorithm,  known  as  the  Adaptive  Subspace  Adapter  (ASD).  In  very  general  terms,  the  ASD  algorithm  assumes  that 
given  target  and  background  spectra  can  be  modeled  as  subspaces  in  a  certain  ^-dimensional  vector  space;  and  the 
decomposes  an  unknown  sample  into  target  /  background  components  via  linear  algebraic  projection  operators.  In  short, 
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if  the  given  sample  contains  a  ‘large’  target  component,  it  is  assumed  that  sample  contains  the  target  material  as  one  of 
the  pure  in  the  components  in  the  mixture;  if  not,  then  the  material  is  not  present. 
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Fig.  1.  SWOrRD  Spectrum  of  Acetonitrile.  Top:  Raman  spectrum  from  a  single  input  illumination  (240  nm).  Bottom: 
The  full  2d  SWOrRD.  The  x  axis  corresponds  to  wavenumber;  the  y  axis  corresponds  to  the  input  laser  wavelength. 


In  the  next  section,  we  present  a  general  overview  of  the  ASD  algorithm,  and  discuss  how  we  modified  this  algorithm  to 
work  with  SWOrRD  spectra.  This  is  followed  by  the  results  of  applying  our  modified  algorithm  to  a  variety  of  liquid 
and  solid  chemical  mixtures. 


2.  TARGET  DETECTION 

In  this  section  we  present  the  details  of  our  target  detection  algorithm.  We  begin  this  section  with  a  brief  overview  of  the 
linear  mixing  assumption,  and  how  this  assumption  may  be  used  to  find  targets  in  mixed  spectra.  This  is  followed  by  a 
general  discussion  of  the  Adaptive  Subspace  Detector  (ASD),  which  forms  the  basis  of  our  detection  algorithm.  We 
conclude  by  showing  how  we  modified  the  ASD  algorithm  to  work  with  the  SWORRD  data. 


2.1  Linear  Mixing 

The  main  assumption  underlying  our  detection  algorithm  is  the  Linear  Mixing  Model  (LMM);  intuitively,  the  linear 
mixing  assumption  is  that  a  measured  sample  containing  multiple  materials  can  be  written  (at  least  approximately)  as  a 
weighted  linear  combination  of  the  spectra  of  each  of  the  components.  In  more  formal  terms,  we  begin  by  assuming 

that  each  sample  spectrum  s  can  be  written  as  a  ^-dimensional  vector  S  E  $Rn  .  (For  the  moment,  we  will  be  deliberately 


vague  about  what  n  represents;  we  will  return  to  this  question  in  Sec.  3.3.)  If  the  sample  contains  the  k  ‘pure’  materials 
S1? . . . ,  Sk  ,  then  the  LMM  assumption  is  that  we  can  write  s  as  the  sum 

k 

s  =  ^aiSi+7/  (1) 

i=l 

where  Sj  G  9?n  are  the  vectors  corresponding  to  the  individual  components,  T]  G  9tn  is  a  noise  /  modeling  error  term, 

and  a{  G  9?  are  scalars.  Intuitively,  the  OCx  represent  the  ‘fractional  amount’  of  the  material  present  in  the  sample,  and 
are  sometimes  known  as  the  abundance  coefficients.  Note  that  Eq.  1  can  be  written  more  succinctly  in  matrix  form  as 

s  =  Ma  +  77 

where  M  is  the  n-by-k  matrix  whose  columns  are  the  vectors  Sj ,  and  OC  =  iti  , _ ,  OCk  ^contains  the  abundances.  With 

this  notation,  it  can  be  shown  that  the  least-squares  optimal  solution  to  estimating  the  abundance  coefficients  are  given 
by  the  pseudo-inverse: 

5=  -M^M1 -s  (2) 

2.2  Mixed  Spectra  Target  Detection 

In  the  linear  mixing  model,  it  is  assumed  that  a  mixed  sample  spectrum  can  be  decomposed  into  a  sum  of  its 
constituents.  In  the  target  detection  problem,  the  goal  is  to  decide  whether  a  given  material  (the  ‘target’)  is  present  in  a 
given  sample.  To  that  end,  we  begin  assuming  that  there  exists  a  library  of  known  ‘background’  spectra  (that  is,  non¬ 
target  materials  that  may  or  may  not  be  present  in  a  given  sample),  and,  following  Eq.  1,  write  a  given  sample  s  as  the 
sum 


m 

s  =  £  /?iSj  +  a  •  t  +  rj 

i=l 

where  Sj  G  $Rn  are  the  background  spectra,  and  t  G  9Tn  represents  the  target  signature.  In  rough  terms,  the  target 
detection  problem  reduces  to  deciding  whether  a  =  0  (which  means  the  target  is  missing,  or  not  present  in  the  sample), 
or  a  ^  0  (target  present). 

Up  to  this  point,  we  have  been  (implicitly)  assuming  that  each  component  spectrum  can  be  modeled  as  a  single, 
unvarying  vector  Sj  G  $Rn  .  In  reality,  different  measurements  of  the  same  material  will  not  be  exactly  identical,  but  will 
vary  slightly  from  run  to  run,  due  to  (among  other  things)  differences  in  lab  conditions,  sample  preparation,  etc.  In 
general,  modeling  this  variability  can  be  done  either  statistically  (usually  by  assuming  a  priori  some  type  of  distribution) 
or  geometrically.  We  use  the  second  approach;  in  particular,  we  assume  a  subspace  model  for  the  variation.  More 

precisely,  we  assume  that,  for  each  pure  component,  there  exists  a  corresponding  n-by-l  matrix  Sj  such  that  each 

measured  component  spectrum  S{  can  be  written  as  Sj  =  SjCT  .  Note  that  <J{  will  vary  among  different  measurements 
of  the  same  material. 

In  the  context  of  target  detection,  this  means  that  there  exists  matrices  B  and  T  representing  the  background  and  target 
materials,  respectively,  such  that  every  mixed  sample  s  can  be  written  as 


s  =  B j3  +  Ta  +  rj . 


As  above,  the  target  detection  problem  reduces  to  deciding  whether  the  components  of  Ot  are  zero  (meaning  the  target  is 
absent)  or  not  (target  present).  More  formally,  this  can  be  written  as  a  statistical  hypothesis  test 


H0  :  s  =  B/?  +  tj 
Hj  :  s  =  B/?  +  Ta  +  ri 


where  the  null  hypothesis  H0  states  that  the  sample  contains  only  background  material  (target  absent),  while  the 
alternative  hypothesis  is  that  the  target  is  present. 

2.3  The  Adaptive  Subspace  Detector 

In  target  detection  problems,  the  usual  method  for  deciding  hypothesis  tests  of  the  form  Eq.  2  is  the  generalized 
likelihood  ratio  test  (GLRT),  in  which  the  null  hypothesis  is  rejected  (that  is,  a  target  is  deemed  present)  if  and  only  if  a 
certain  statistic  is  above  a  given  threshold.  For  the  subspace  model  above,  and  assuming  that  the  error  is  distributed  as 

77-NCc-2I  ,  it  can  be  shown  that  the  GLRT  statistic  is  given  by 


Here  S  e  9?n  is  a  measured  sample,  and  P*1  are  (complementary)  projection  operators  defined  as 

I*b  =  I -  =  I  -  B  0*B  ^  B‘ 

pz=i-pz=i-zez>‘ 

where  B  is  the  matrix  of  background  signatures,  and  Z=  1  B  is  the  joint  matrix  of  target  and  background 
signatures.  The  detector  D  is  generally  known  as  the  adaptive  subspace  detector  (ASD)  in  the  signal-  and  hyperspectral 
image  processing  community,  and  as  the  F-test  in  statistics. 


Fig.  2.  Representation  of  the  ASD  algorithm.  Each  target  spectrum  and  background  spectra  are  used  to  define 
subspaces.  Test  cases  are  then  projected  into  each  subspace,  and  the  distance  from  the  subspace  is  calculated. 


2.4  SWORRD  ASD  Target  Detection 

In  order  to  implement  the  ASD  target  detector  Eq.  3,  it  is  necessary  to  define  the  target  and  background  subspaces 
defined  in  the  previous  subsection.  Implicit  in  this  definition  is  how  to  represent  the  spectra  as  vectors  in  some  n- 
dimensional  space.  Recall  that  a  SWORRD  spectrum  is  modeled  as  an  n-by-p  matrix,  where  p  is  the  number  of  laser 
runs  in  the  sample,  and  n  is  the  number  of  bands  (or  wavenumbers)  for  each  run.  One  way  to  model  this  would  be  as  an 
nxp  vector;  however,  this  implies  a  tremendous  number  of  dimensions  (on  the  order  of  106)  for  each  vector.  Such  high- 
dimensionality  has  a  number  of  unwanted  consequences;  for  this  reason,  we  use  a  different  representation. 

In  geometrical  terms,  the  ASD  algorithm  defines  the  space  containing  the  data  into  target  and  background  subspaces 
defined  by  the  matrices  B  and  T.  Our  goal  is  to  define  these  subspaces;  our  approach  is  as  follows:  let  s  be  a  given  2d 
SWORRD  spectrum,  whose  columns  Si  are  ^-dimensional  vectors  corresponding  to  a  single  laser  run  (that  is,  a  typical  Id 
Raman  spectrum,  taken  at  a  certain  laser  wavelength).  The  SWORRD  spectrum  thus  defines  a  set  of  points  in  n- 
dimensional  wavenumber  space;  this  set  of  points  will  define  a  relatively  low-dimensional  space  (that  can  be  determined 
via  singular  value  decomposition,  or  SVD).  We  take  this  subspace  to  be  the  target  /  background  subspaces  needed  in 
ASD. 

To  be  more  precise,  suppose  we  are  interested  in  detecting  a  certain  material  whose  SWORRD  spectrum  is  t  against  a 
background  that  is  comprised  of  m  SWORRD  spectra  bl9...,bm.  We  begin  by  calculating  a  ‘target  space’  by 

modeling  the  individual  runs  as  points  in  $Rn;  we  then  use  a  standard  SVD  analysis  to  determine  the 

dimensionality  and  basis  vectors  for  the  subspace  containing  these  points.  The  basis  vectors  are  then  used  to  construct 
the  n-by-k  matrix  T  in  Eq.  2  (here  k  is  the  estimated  dimensionality  of  the  target  space).  Similarly,  we  model  each  run 

by  of  background  spectrum  bi  a  point  in  $Rn  ;  the  set  of  all  (m  x  p)  points  b:  l ,  b1  2 , . . . ,  b1  p ,  b2  l , . . . ,  bm  p  is  again 

modeled  via  SVD  to  define  a  background  matrix  B.  To  run  the  detector  against  a  sample  s,  we  use  the  constructed 
matrices  to  apply  Eq.  3  to  each  of  the  individual  runs  Si  of  the  sample  s;  each  run  outputs  a  scalar  di  that  represents  a 
‘target  score’  for  that  laser  wavelength.  To  compute  a  final  score  D,  we  simply  compute  the  norm  of  the  individual 
outputs 

D  =  7d,’-+...  +  d|>2  . 

3.  EXPERIMENTAL  RESULTS 

To  test  our  algorithms,  we  began  with  a  set  of  5  liquid  chemicals  (acetonitrile,  ethanol,  methanol,  ethylene  glycol,  and 
water),  that  were  measured  by  SWORRD  and  used  as  our  pure  elements.  We  then  created  14  various  combinations  of 
these  chemicals,  which  were  generally  measured  3  or  4  times  each,  for  a  total  of  68  samples.  We  also  used  a  ‘dummy’ 
chemical  (cyclohexane)  that  was  included  in  our  library  but  not  in  any  of  the  mixtures;  this  was  meant  to  test  whether 
any  ‘false  positives’  would  arise.  An  example  of  one  of  the  mixture  spectrum  is  shown  in  Fig.  3.  The  composition  of  the 
mixtures  are  summarized  in  table  1.  We  note  that  mixture  consisted  of  equal  volumes  of  each  chemical  present;  using 
the  molecular  weight,  we  were  then  able  to  calculate  the  ‘amount’  (as  measured  by  fractional  molecular  volume). 
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Fig.  3  SWORRD  spectra  from  a  liquid  mixture  of  4  chemicals  (ethanol,  methanol,  ethylene  glycol,  and  water) 


Our  first  experiment  was  to  test  whether  the  linear  mixing  model  was  appropriate  for  Raman  spectral  mixtures.  In 
particular,  we  used  the  measured  pure  chemicals  as  the  components  in  Eq.  1  above,  and  then  ‘unmixed’,  or  calculated  the 
abundances,  for  each  mixtures  using  Eq.  2.  The  results  are  summarized  in  Fig.  4  below.  Each  chart  shows  the  actual 
amount  on  the  x-axis,  and  the  estimated  abundance  on  the  y-axis;  ideally,  the  samples  would  lie  on  the  1-1  line.  As  can 
be  seen  from  the  figures,  the  linear  mixing  model  appears  to  be  doing  a  pretty  good  job  of  estimating  the  actual 
concentrations,  with  the  exception  of  water.  We  believe  this  is  due  to  the  fact  that  water  has  only  very  broad,  not  well- 
defined  Raman  features,  and  it  is  difficult  to  Tine  up’  the  peak  between  different  samples. 


Our  second  experiment  was  to  run  the  ASD  detector  described  in  Sec.  2.  4  above  against  each  of  samples.  In  the 
experiment,  each  individual  chemical  was  defined  as  the  ‘target’,  and  the  remaining  5  chemicals  (including  cyclohexane) 
were  assumed  to  be  background;  as  a  result,  6  different  detectors  were  defined.  The  results  are  summarized  in  Fig.  5 
below.  Each  figure  shows  the  associated  ASD  score  for  the  given  chemical  ‘target’  for  each  of  the  mixtures.  For  ease  of 
interpretation,  the  scores  are  color-coded  as  follows:  the  green  scores  correspond  to  the  pure  samples,  the  blue  scores  are 
mixed  samples  which  contain  the  associated  target,  and  the  red  scores  are  samples  which  do  not  contain  the  target.  It  is 
easily  seen  that  the  ASD  detector  is  able  to  find  the  target  in  each  of  the  correct  samples,  and  gives  very  low  scares  when 
the  target  is  absent. 
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Table  1.  Composition  of  the  14  mixtures  in  the  experiment.  Each  mixture  contains  equal  volumes  of  each  chemical. 


Fig  4.  Actual  vs.  estimated  amounts  of  each  chemical  in  the  mixtures.  Clockwise  from  top  left:  acetonitrile,  ethanol, 

ethylene  glycol,  water,  and  methanol. 


Sample  Number 


Fig.  5.  ASD  detector  scores  for  each  chemical  and  each  sample.  Top  row:  acetonitrile,  cyclohexane.  Middle  row: 

ethylene  glycol,  ethanol.  Bottom  row:  methanol,  water. 
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